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(54) Packing method, recording medium and transmitting and receiving apparatus for variable 
length data 



(57) Audio data at the top of each packet is always 
set at the top of sample data and packets can be treated 
as units, thus facilitating a timing process, sequential 
processing and the like for processing audio data. A plu- 
rality of samples consisting of variable length data are 
arranged in a packet of predetermined bytes. In this 
case, other samples than the first sample in the plurality 
of samples are sequentially arranged with the top of the 

1 PACK (=1 PACKET) 2048 BYTES 



first sample being positioned at a predetermined posi- 
tion of the packet, the total byte length of the sample 
data is equal to or smaller than a maximum byte length 
of the packet, and when a byte length of the packet is 
less than the maximum byte length, invalid data of a 
stuffing byte or a padding byte is inserted in a remaining 
portion. 
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Description 

The present invention relates to a data arranging 
method and a medium for recording or transferring data 
or the like to be recorded on a digital video disk and a 
digital audio disk, and a signal processing apparatus for 
processing the data. 

Recently, digital video disks have been developed 
as optical disks in addition to conventional compact 
disks (hereinafter referred to as "CDs") for audio usage, 
and players for such digital video disks have also been 
developed. In particular, the digital video disks include a 
kind which is about the same size (12 cm in diameter) 
as the conventional CDs and is designed in such that 
about two hours of picture information can be recorded 
on and reproduced from that disk. For such a digital 
video disk, there is a format which allows voices or 
musics in eight different languages and superimposition 
information in thirty two different languages to be 
recorded on the same disk in addition to picture infor- 
mation. 

Again, digital video disks which can record voices 
or musics in multiple languages in addition to main pic- 
ture information and is the same in size with the conven- 
tional CDs have been developed. 

If such digital video disks become available on the 
market, naturally, it would be a natural demand to repro- 
duce pieces of music or voices (audio signals) from new 
digital video disks as well as from the conventional CDs. 
The recording systems for audio signals include a com- 
pression system and a linear PCM system. If one con- 
sider For a video disk from which audio signals of 
pieces of music and voices can be reproduced by an 
exclusive audio player, it is effective to record data by 
the linear PCM technique as used for the conventional 
CDs. If multiple kinds of data are recorded on a record- 
ing medium, it is necessary to synchronize reproduction 
timings of the data with one another. In such a case, it is 
desirable that the data can be handled easily. 

Accordingly, it is an object of the present invention 
to provide a packing method, a recording medium and a 
transmitting and receiving apparatus for variable length 
data, which facilitate the handling of audio data the 
number of whose channels and the number of whose 
quantization bits are set in multifarious ways. 

To achieve the above object, this invention is char- 
acterized in that in storing a packet having plural pieces 
of sample data of different data lengths in a pack of pre- 
determined bytes in accordance with the number of 
channels, the plural pieces of sample data are sequen- 
tially arranged with a top of the sample data being posi- 
tioned at a predetermined position of the packet, a total 
byte length of the sample data is equal to or smaller 
than a maximum byte length of the packet, and when a 
byte length of the packet is less than the maximum byte 
length, invalid data of a stuffing byte or a padding byte is 
inserted in a remaining portion. 

With the use of such a packing method, audio data 
at the top of each packet is always the top of sample 



data and packets can be treated as units. This facilitates 
a timing process, sequential processing and the like for 
processing audio data. 

This invention can be more fully understood from 
5 the following detailed description when taken in con- 
junction with the accompanying drawings, in which: 

FIGS. 1A through 1D are explanatory diagrams 
showing a sample structure and the arrangement of 
10 samples for explaining a basic embodiment of this 
invention; 

FIGS. 2A and 2B are explanatory diagrams show- 
ing an example of the arrangement of packs and 
the structure of an audio pack in this arrangement 

15 according to this invention; 

FIG. 3 is an explanatory diagram depicting the 
detailed structure of an audio pack; 
FIG. 4 is an explanatory diagram exemplifying a list 
of sizes of linear PCM data in a packet, to which this 

20 invention is adapted; 

FIG. 5 is an explanatory diagram illustrating proce- 
dures of generating an audio pack; 
FIG. 6 is a block structural diagram of a disk playing 
apparatus; 

25 FIG. 7 is an explanatory diagram of a disk drive 
section; 

FIG. 8 is an explanatory diagram of an optical disk; 
FIG. 9 is an explanatory diagram illustrating the log- 
ical format of an optical disk; 
30 FIG. 10 is an explanatory diagram of a video man- 
ager in FIG. 9; 

FIG. 11 is an explanatory diagram of a video object 
set in FIG. 8; 

FIG. 12 is an explanatory diagram of a program 
35 chain; 

FIG. 1 3 is a diagram showing one example of the 
basic circuit structure of an audio decoder accord- 
ing to this invention; 

FIG. 14 is a diagram showing another example of 
40 the basic circuit structure of the audio decoder 
according to this invention; 

FIG. 15 is a diagram showing a further example of 
the basic circuit structure of the audio decoder 
according to this invention; 
45 FIG. 1 6 is a diagram showing a still further example 
of the basic circuit structure of the audio decoder 
according to this invention; 

FIG. 17 is a table representing the contents of the 
pack header of the audio pack; 
so FIG. 18 is a table illustrating the contents of the 
packet header of the audio pack; 
FIG. 19 is a block diagram showing mainly the 
audio data processing system incorporated in the 
disk playing apparatus; 
55 FIGS. 20A to 20D are diagrams showing a disk, a 
pit train, a sector train, and a physical sector, 
respectively; 

FIGS. 21 A and 21 B are respectively a diagram 
showing a physical sector and a table representing 
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the contents of the physical sector; 
FIGS. 22A and 22B are diagrams showing the 
structure of a recording/recorded sector; and 
FIGS. 23A and 23B are diagrams illustrating an 
error-correction code block. 

Preferred embodiments of the present invention will 
now be described with reference to the accompanying 
drawings. 

To begin with, a data arrangement by the linear 
PCM system in the data recording system according to 
this invention will be discussed. Note that 1 6 bits. 20 bits 
or 24 bits, for example, are arbitrarily used as quantiza- 
tion bits in linear PCM data. Further, audio modes 
include monaural, stereo, 3 channels, 4 channels, 5 
channels, 6 channels, 7 channels and 8 channels 
modes. 

Suppose that there are eight channels (A to H) of 
audio signals. Those audio signals are sampled at a 
.sampling frequency of 48 KHz or 96 KHz to be quan- 
tized. The following will describe an example where the 
quantization bits are 20 bits. 

FIG. 1 A shows how eight channels of audio signals 
A to H are sampled. It is assumed that each sample be 
quantized to, for example, 20 bits. It is also illustrated 
that each sample of 20 bits is separated to a main word 
and an extra word. 

The main words of the individual channels are indi- 
cated by large alphabet letters plus a suffix "n," and the 
extra words by small alphabet letters plus the suffix "n" 
where n (= 0, 1, 2, 3, ...) indicates the sampling order. 
Each main word consists of 16 bits and each extra word 
consists of 4 bits. 

Individual samples are generated in the form of 
AOaO, A1a1 , A2a2, A3a3, A4a4 and so forth for the sig- 
nal A, BObO, B1b1, B2b2, B3b3, B4b4 and so forth for 
the signal B, COcO, C1c1, C2c2, C3c3, C4c4 and so 
forth for the signal C, HOhO, H1h1, H2h2, H3h3, 
H4h4 and so forth for the signal H. 

FIG. 1 B illustrates the above word arrangement for- 
mat as a sequence of samples in the case where those 
words are recorded on a recording medium. 

First, each sample data consisting of 20 (= M) bits 
is separated to a main word of 16 (= ml) bits on the 
MSB side and an extra word of 4 (= m2) bits on the LSB 
side. Next, the zero-th (= 2n-th) main words in the indi- 
vidual channels are collectively arranged. Then, the first 
(= (2n+1)-th) main words in the individual channels are 
collectively arranged. Then, the zero-th (= 2n-th) extra 
words in the individual channels are collectively 
arranged. Then, the first (= (2n+1)-th) extra words in the 
individual channels are collectively arranged. Note that 
n =0, 1, 2, .... 

A group of main words in the individual channels is 
one main sample. Likewise, a group of extra words in 
the individual channels is one extra sample. 

With such a format employed, a data reproduction 
process by a low-cost machine (e.g., the one which 
operates in a 16-bit mode) should handle only main 



words, while a data reproduction process by a high-cost 
machine (e. g ., the one which operates i n a 20-bit mode) 
should handle both main words and their associated 
extra words. 

5 FIG. 1C shows how individual samples are 

arranged by using the specific numbers of bits for the 
main sample and extra sample. 

In the form of such quantized linear PCM codes, the 
separation of a 20-bit sample to a 16-bit main word and 

10 a 4-bit extra word can permit the following. The machine 
which operates in the 16-bit mode can easily discard 
unnecessary portions by performing data processing in 
the units of 8 bits in the areas of extra samples in the 
sample arrangement. This is because two extra sam- 

15 pies are 4 bits x 8 channels and 4 bits x 8 channels, and 
those data can be processed (discarded) eight consec- 
utive times in the units of 8 bits. 

The feature of this data arrangement is not limited 
to that of this embodiment. In either case where there 

20 are an odd number of channels, or where an extra word 
consists of 8 bits, the total number of bits of two consec- 
utive extra samples is an integer multiple of 8 bits, so 
that a low-cost machine which reproduces only main 
words can skip extra samples by executing a discarding 

25 process n consecutive times 8 bits by 8 bits in accord- 
ance with the mode. 

Data in the state in FIG. 1 B may then be subjected 
to a modulation process to be recorded on a recording 
medium. If data is to be recorded together with other 

30 control information and video information, it is prefera- 
ble that data should be recorded in the form that is eas- 
ily managed on the time base in order to facilitate data 
handling and synchronization. In this respect, the fol- 
lowing frame formation, grouping and packet formation. 

35 FIG. 1 D shows a sequence of audio frames. The 
unit of data over a given reproduction time is 1/600 sec 
which is one frame. In one frame, 80 or 160 samples are 
assigned. With a sampling frequency of 48 KHz, one 
sample is 1/4800 sec and (1/48000) x 80 samples = 

40 1/600 sec. With a sampling frequency of 96 KHz, one 
sample is 1/9600 sec and (1/96000) x 160 samples = 
1/600 sec. Obviously, one frame consists of 80 samples 
or 160 samples. 

FIG. 2A shows the relationship between the packet 

45 and frame. 

DSI is data search information, V is a video object, 
A is an audio object and S is a sub picture object. Each 
block is called a pack. One pack is defined to be 2048 
bytes. One pack includes one packet, and consists of a 

so pack header, a packet header and a packet. Described 
in DSI is information for controlling each data at the 
playback time, such as the start address and end 
address of each pack. FIG. 2B shows only audio packs 
extracted. Although DSI packs, video packs V and audio 

55 packs A are actually mixed in the arrangement as 
shown in FIG. 3A, only audio packs A are illustrated in 
FIG. 2B to help understand the relationship between a 
frame and packs. According to the standards of this sys- 
tem, information is so arranged that it takes about 0.5 
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sec to reproduce information between a DSI and the 
next DSL As one frame is 1/600 sec as mentioned 
above. 30 audio frames exist between one DSI and 
another DSI. The amount of data (D) of one frame var- 
ies depending on the sampling frequency (fs), the 5 
number of channels (N) and the number of quantization 
bits (m). 

When fs = 48 KHz, D = 80 x N x m , and when fs = 
96 KHz, D = 160 x N x m . 

Therefore, one frame should not necessarily corre- 10 
spond to one pack; a plurality of frames or a less than 
one frame may correspond to one pack. That is, the 
head of a frame may come in a middle of one pack as 
shown in FIG. 3B. Positional information of the head of 
a frame is described in the pack header, and is 15 
described as the number of data counts (timings) from 
the pack header or DSI. In reproducing data from the 
aforementioned recording medium, the reproducing 
apparatus acquires a frame of audio packets, extracts 
data of a channel to be reproduced, and supplies the 20 
data to the audio decoder to perform a decoding proc- 
ess. 

FIG. 3 shows an audio pack in enlargement. 
Arranged in the data section of this audio pack are twin 
pairs of samples with the top twin pair of samples (AO- 25 
HO, A1-H1) located at the head of the data area. The 
number of bytes in one pack is fixed to 2046 bytes. As 
samples are variable length data, 2048 bytes should not 
necessarily be equal to an integer multiple of the byte 
length of twin pairs of samples. Therefore, there may be 30 
a case where the maximum byte length of one pack dif- 
fers from the byte length of (a twin pair of samples x 
integer number). In this case, the byte length of a pack 
is set to become the pack s byte length ^ (a twin pair of 
samples x integer number). If a part of a pack remains, 35 
a stuffing byte is inserted in the pack header when the 
remainder is equal to or less than 7 bytes while a pad- 
ding packet is inserted at the end of the pack when the 
remainder exceeds 7 bytes. 

Audio information in this pack format can easily be 40 
handled at the time of reproduction. 

As the top audio data in each pack is the top twin 
pair of samples or main samples, the reproduction proc- 
ess becomes easier when reproduction is executed at 
the proper timing. This is because the reproduction 45 
apparatus acquires data and performs data processing 
pack by pack. If samples of audio data are located over 
two packs, the two packs should be acquired and audio 
data after integration should be decoded. This compli- 
cates the processing. When the top audio data in each so 
pack is always the top twin pair of samples and audio 
data is grouped pack by pack as in this invention, timing 
should be taken only for one pack, thus facilitating the 
data processing. Further, the packet-by-packet data 
processing simplifies the authoring system (aiding sys- 55 
tern), which can simplify software for processing data. 

At the time of special reproduction or the like, par- 
ticularly, video data may be subjected to thinning or 
interpolation. In such a case, since audio data is permit- 



ted to be handled packet by packet, it is possible to rel- 
atively easily control the reproduction timing. Further, 
software for the decoders need not be complicated. 

Although samples are generated with each sample 
separated into the upper 16 bits and the lower 4 bits in 
the above-described system, data should not necessar- 
ily take such a format as long as linear PCM audio data 
is sampled. 

With the data length of an extra sample being 0, for 
example, a train of data becomes a sequence of main 
samples which is the general data format. In this case, 
no extra samples are present, so that there is no need 
to generate twin pairs of samples and main samples 
alone should be formed into packets. 

FIG. 4 shows a list of the sizes of linear PCM data 
when linear PCM data are arranged in a packet in the 
units of twin pairs of samples as discussed above. The 
data sizes are shown as the number of maximum sam- 
ples to be fitted in one pack, separately for the monaural 
(mono), stereo and multichannel modes. Each group 
shows the data sizes for the respective numbers of 
quantization bits. Because of twin pairs of samples 
taken as units, every number of samples in one packet 
is an even number. As the number of channels 
increases, the number of bytes increases accordingly, 
so that the number of samples in one packet decreases. 
When the number of quantization bits is 16 bits and the 
mode is the monaural mode, the number of samples in 
one packet is 1004, and the number of bytes is 2008 
with the stuffing byte of 5 bytes, which indicates that 
there is no padding bytes. Note however that the first 
packet has the stuffing bytes of 2 bytes. This is because 
3-byte attribute information may be affixed to the header 
of the first packet. 

With the number of quantization bits being 24 bits 
and in the stereo mode, stuffing of 6 bytes is given to the 
top packet and padding of 9 bytes is given to the subse- 
quent packets. 

FIG. 5 illustrates the operational procedures of the 
apparatus which generates packs. 

Suppose that audio signals of each channel are 
samples to produce the samples as shown in FIG. 1B, 
which are stored in the memory. In step S11, the sam- 
ples are acquired one by one. In step S12, it is deter- 
mined if the number of bytes has reached the capacity 
of a packet (2020 bytes). When it has reached 2010 
bytes, those samples up to that sample are packed 
(step S13). 

When the number of bytes has not reached the 
capacity of a packet (2020 bytes), the flow proceeds to 
step S14 where it is determined if the number of bytes 
of the acquired samples exceeds 2010 bytes. When it 
does not exceed 2010 bytes, the flow returns to step 
S11. When it exceeds 2010 bytes, on the other hand, 
the last acquired sample is returned to the position of 
step S1 1 and the difference between the number of 
remaining bytes and 2010 bytes is computed in step 
S15. It is then determined if this difference R exceeds 8 
bytes (step S16). When the difference R exceeds 8 
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bytes, padding is performed (step S17) to construct a 
packet, whereas when the difference R is equal to less 
than 7 bytes, stuffing is performed (step S18) to con- 
struct a packet. 

The reproduction apparatus which reproduces the 5 
above-discussed data will be briefly described. 

FIG. 6 shows an optical disk player, FIG. 7 shows 
the basic structure of a disk drive section 501 which 
drives an optical disk 10 on which the above-described 
audio stream is recorded, and FIG. 8 presents a dia- 10 
gram for explaining an example of the structure of the 
optical disk 10. 

The optical disk player in FIG. 6 will now be dis- 
cussed. 

The optical disk player has a key operation/display is 
section 500. The optical disk player is connected to a 
monitor 1 1 and speakers 1 2. Data picked up from the 
optical disk 10 is sent via the disk drive section 501 to a 
system processing section 504. The picked-up data 
from the optical disk 10 includes picture data, sub pic- 20 
ture data and audio data, for example, which are sepa- 
rated in the system processing section 504. The 
separated picture data is supplied via a video buffer 506 
to a video decoder 508 , the sub picture data is supplied 
via a sub picture buffer 507 to a sub picture decoder 25 
509. and the audio data is supplied via an audio buffer 
507 to an audio decoder 513. The picture signal 
decoded by the video decoder 508 and the sub picture 
signal decoded by the sub picture decoder 509 are 
combined by a synthesizing section 510, and the result- 30 
ant signal is converted to an analog picture signal by a 
D/A converter 51 1 . This analog picture signal is then 
sent to the monitor 1 1 . The audio signal decoded by the 
audio decoder 513 is converted by a D/A converter 514 
to an analog audio signal which is in turn supplied to the 35 
speakers 12. 

The entire player is controlled by a system CPU 
502. That is, the system CPU 502 can exchange control 
signals, timing signals and the like with the disk drive 
section 501 , the system processing section 504 and the 40 
key operation/display section 500. Connected to the 
system CPU 502 is a system ROM/RAM 503 in which 
fixed programs for allowing the system CPU 502 to exe- 
cute data processing are stored. Management data or 
the like which is reproduced from the optical disk 1 0 can 45 
also be stored in the system ROM/RAM 503. 

A data RAM 505, connected to the system process- 
ing section 504, is used as a buffer when the aforemen- 
tioned data separation, error correction or the like is 
executed. so 

The disk drive section 501 in FIG. 7 will now be dis- 
cussed. 

A disk motor driver 531 drives a spindle motor 532. 
As the spindle motor 532 rotates, the optical disk 10 
turns and data recorded on the optical disk 10 can be 55 
picked up by an optical head section 533. The signal 
picked up by the optical head section 533 is sent to a 
head amplifier 534 whose output is input to the system 
processing section 504. 



A feed motor 535 is driven by a feed motor driver 
536. The feed motor 535 drives the optical head section 
533 in the radial direction of the optical disk 10. The 
optical head section 533 is provided with a focus mech- 
anism and a tracking mechanism to which drive signals 
from a focus circuit 537 and a tracking circuit 538 are 
respectively supplied. 

Control signals are input to the disk motor driver 
531 , the feed motor driver 536, the focus circuit 537 and 
the tracking circuit 538 from a servo processor 539. 
Accordingly, the disk motor 532 controls the rotation of 
the optical disk 10 in such a way that the frequency of 
the picked-up signal becomes a predetermined fre- 
quency, the focus circuit 537 controls the focus mecha- 
nism of the optical system in such a way that the optical 
beam from the optical head section 533 forms the opti- 
mal focal point on the optical disk 10, and the tracking 
circuit 538 controls the tracking mechanism in such a 
way that the optical beam hits the center of the desired 
recording track. 

The structure of the optical disk 10 shown in FIG. 8 
will now be explained. 

The optical disk 10 has information recording areas 
22 around clamp areas 21 on both sides. The informa- 
tion recording area 22 has a lead-out area 23 where no 
information is recorded at the outer periphery, and a 
lead-in area 24 where no information is recorded at the 
boundary with the associated clamp area 21 . Between 
the lead-out area 23 and the lead-in area 24 lies a data 
recording area 25. 

Tracks are continuously formed in the data record- 
ing area 25 in a spiral form. The tracks are separated 
into a plurality of physical sectors which are given serial 
numbers. Signal spots on tracks are formed as pits. For 
read-only optical disk, a sequence of pits is formed on a 
transparent substrate by a stamper, and a reflection film 
is formed on the pits-formed surface to form a recording 
layer. A double-disk type optical disk has two disks 
adhered together via an adhesive layer, yielding a com- 
posite disk, in such a manner that those recording lay- 
ers face each other. 

The logical format of the optical disk 10 will now be 
discussed. 

FIG. 9 shows the logical format of the information 
sections of the information recording area 25. This logi- 
cal format is determined in conformity to specific stand- 
ards, such as micro UDF and ISO 9660. In the following 
description a logical address means a logical sector 
number (LSN) which is determined by the micro UDF 
and ISO 9660, and logical sectors are the same in size 
as the aforementioned physical sectors, each logical 
sector consisting of 2048 bytes. It is assumed that serial 
logical sector numbers (LSN) are given to the logical 
sectors in the ascending order of the physical sector 
numbers. 

The logical format is a hierarchical structure and 
has a volume and file structure area 70, a video man- 
ager 71 , at least one video title set 72 and other record- 
ing area 73. Those areas are differentiated at the 
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boundaries of the logical sectors. As mentioned above, 
the size of one logical sector is 2048 bytes. The size of 
one logic block is also 2048 bytes, so that one logical 
sector is defined as one logic block. 

The file structure area 70 is equivalent to a man- 
agement area which is defined by the micro UDF and 
ISO 9660, and data in the video manager 71 is stored in 
the system ROM/RAM section 52 via the description in 
this area 70. Information for managing the video title 
sets is described in the video manager 71, which con- 
sists of a plurality of files 74 starting with a file #0. 
Recorded in each video title set 72 are compressed 
video data, sub picture data, audio data and playback 
control information for reproducing those data. Each 
video title set 72 consists of a plurality of files 74, which 
are also differentiated at the boundaries of the logical 
sectors. 

Recorded in the other recording area 73 is informa- 
tion which is used when the information in the video title 
set is used or information which is exclusively used. 

The video manager 71 will be described below with 
reference to FIG. 10. 

The video manager 71 consists of video manager 
information (VMGI) 75, a video object set for a video 
manager information menu (VMGM_VOBS) 76 and a 
backup of video manager information (VMGI_BUP) 77. 

Stored in the VMGM_VOBS 76 are video data, 
audio data and sub picture data for the menu which is 
associated with the volume of the optical disk. The 
VMGM_VOBS 76 can provide descriptive information, 
given by voices and a sub picture in association with 
each of titles in the volume, and the selection display for 
the titles. When English conversations for learning Eng- 
lish are recorded on the optical disk, for example, the 
title name of each English conversation and examples 
of a lesson are reproduced and displayed while a theme 
song is acoustically reproduced, and each sub picture 
shows which text of which level or the like. The lesson 
numbers (levels) are displayed as selection items which 
should be selected by a listener. The VMGM_VOBS 76 
is used for such a usage. 

FIG. 1 1 exemplifies a video object set (VOBS) 82. 

There are two types of video object sets for a menu 
and one type of video object set for video titles, those 
three types having similar structures. 

The VOBS 82 is defined as a set of one or more 
video objects (VOB's) 83, which are used for the same 
purpose. Normally, the VOBS for a menu consists of 
video objects (VOB's) for displaying a plurality of menu 
screens, while the VOBS for a video title set consists of 
VOB's for displaying normal moving pictures or the like. 

Each VOB is given an ID number (VOB_IDN#j), 
which is used to identify that VOB. One VOB consists of 
one cell or a plurality of cells 84. Likewise, each cell is 
given an ID number (CJDN#), which is used to identify 
that cell. The video object for a menu may be comprised 
of a single cell. 

Further, one cell consists of one or a plurality of 
video object units (VOBU's). A single VOBU is defined 



as a sequence of packs having a navigation pack (NV 
pack) at the top. One VOBU is defied as a set of all 
packs recorded between the NV pack (including the 
aforementioned DSI) and the next NV pack. 

5 The playback time for the VOBU is equivalent to the 
playback time for video data which consists of a single 
GOP (Group Of Picture) or a plurality of GOP's included 
in this VOBU, and is defined to be equal to or greater 
than approximately 0.4 sec and equal to or less than 1 

10 sec. The MPEG standards define one GOP as com- 
pressed image data equivalent to the playback time of 
about 0.5 sec. According to the MPEG standards, there- 
fore, about 0.5 sec of audio information and picture 
information can be arranged. 

is One VOBU has the aforementioned NV pack at the 
top, followed by video packs (V packs), sub picture 
packs (SP packs) and audio packs (A packs) arranged 
in a certain order. A plurality of V packs in one VOBU 
has compressed image data whose playback time is 

20 equal to or less than 1 sec, in the form of one GOP or a 
plurality of GOP's. Audio signals corresponding to this 
playback time are compressed and arranged as A 
packs. The sub picture data used within this playback 
time is compressed and is arranged as SP packs. It is to 

25 be noted that audio signals are recorded with, for exam- 
ple, eight streams of data as a pack, and sub pictures 
are recorded with, for example, thirty two streams of 
data as a pack. 

One stream of audio signals is data encoded by 

30 one kind of coding system, and consists of eight chan- 
nels of linear PCM quantized data of 20 bits, for exam- 
ple. 

Returning to FIG. 14, the VMGI 75 describes infor- 
mation for searching for a video title, and includes at 
35 least three tables 78, 79 and 80. 

A video manager information management table 
(VMGLMAT) 78 describes the size of the VMG 71 , the 
start address of each information in the video manager, 
attribute information associated with the video object 
40 set for a video manager menu (VMGM_VOBS), and the 
like. 

A title search pointer table (TT_SRPT) 79 describes 
entry program chains (EPGC) of the video titles 
included in the volume of the optical disk which are 
45 selectable in accordance with the title number input 
through the key operation/display section of the appara- 
tus. 

The program chains will now be discussed referring 
to FIG. 12. Each program chain 87 is a set of program 

so numbers for reproducing the story of a certain title, and 
a chapter of the story of one title or the story itself is 
completed as program chains are continuously repro- 
duced. One program number consists of a plurality of 
cell ID numbers each of which can specify a cell in the 

55 VOBS. 

A video title set attribute table (VTS_ART) 80 
describes attribute information which is determined by 
video title sets (VTS') in the volume of the optical disk. 
The attribute information includes the number of VTS', 
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the number, the video compression system, the audio 
coding mode, and the display type of sub pictures. 

According to the packet system according to this 
invention, as described above, audio data at the top of 
each packet is always the atop of sample data, and 
packets can be treated as units, so that the timing 
processing for processing audio data and a sequence of 
processes of this timing processing becomes easier. 

A description will now be given of the audio decoder 
which reproduces data that is arranged and recorded in 
the above-described form. 

FIG. 13 shows the basic structure of the audio 
decoder 513. 

The illustrated decoder can reproduce data in all 
the modes for the numbers of channels and the num- 
bers of bits of samples, shown in FIG. 4. Input data is 
such that the number of quantization bits of every one of 
eight channels is 24 bits. 

A sequence of samples as discussed with refer- 
ence to FIG. 1 is continuously input to an input terminal 
710. This sequence of samples is given to the input ter- 
minal, 71 1 , of a switch SWO. The switch SW0 has distri- 
bution terminals for the individual samples of channels 
An to Hn and an to hn. The terminals which are associ- 
ated with samples of the individual channels are given 
the same reference numerals as representative sam- 
ples. The representative samples are samples AO to 
HO, A1 to H1, aO to hO and a1 to hi . 

It is assumed that the terminals AO to HO and A1 to 
H1 are 16-bit terminals, and the terminals aO to hO and 
a1 to hi are 4-bit terminals. The extra sample may con- 
sist a total of eight bits, so that two sets of 4-brt termi- 
nals, aO to hO and a1 to hi, are prepared. The 16-bit 
terminal AO is connected to the upper bits (1 6 bits) of a 
memory MAO. and the associated 4-bit terminals aO and 
aO are connected to the lower bits (8 bits) of the memory 
MAO via respective switches j1 and j2. The 16-bit termi- 
nal BO is connected via a switch JB to the upper bits of 
a memory MBO, and the associated 4-bit terminals bO 
and bO are connected to the lower bits of the memory 
MBO via respective switches j1 and j2. The 16-bit termi- 
nal CO is connected via a switch JC to the upper bits of 
a memory MCO, and the associated 4-bit terminals cO 
and cO are connected to the lower bits of the memory 
MCO via respective switches j1 and j2. Likewise, the 
other terminals DO to HO, D1 to H1, dO to hO and d1 to 
hi are connected to associated memories MDO to MH1 . 

As a result, the individual channels are distributed 
to the memories MAO to MH1. The output terminals of 
the memories MAO and MA1 are connected to terminals 
TAO, TaO, TaO, TA1 , Ta1 and Ta1 of an A channel output 
switch SWA. TAO and TA1 are 16-bit terminals, and TaO, 
TaO, Ta1 and Ta1 are 4-bit terminals. Likewise, the out- 
put terminals of the memories MBO and MB1 are con- 
nected to terminals TBO, TbO, TbO, TB1 , Tb1 and Tb1 of 
a B channel output switch SWB. TBO and TB1 are 16-bit 
terminals, and TbO, TbO, Tb1 and Tb1 are 4-bit termi- 
nals. The output terminals of the other memories are 
likewise connected to the associated output switches. 



The operation of the audio decoder 51 3 will now be 
discussed. 

Samples SO, S1 , eO, e1 which are arranged for 

the recording/transfer purpose and are to be input to the 

5 switch SWO can be expressed as AO, BO HO, A1 , B1 , 

.., H1, aO, bO hO, a1, b1 hO as samples of the 

individual channels. Each of main words of each chan- 
nel consists of 16 bits, and each extra word consists of 
8 bits. Suppose that the switches of the circuit are all 

10 closed. As the rotary switch SWO is sequentially 
switched from the topmost contact, associated samples 
are transferred to the memories MAO to MH1. In this 
manner, twin pairs of samples are cyclically stored in 
the memories MAO to MH1 by the action of the rotary 

75 switch SWO. Thereafter, samples of the desired channel 
among those samples stored in the memories MAO to 
MH1 are read via the associated rotary switch. The 
main sample and extra sample in each read sample are 
decoded and then combined for the subsequent 

20 processing. 

Let us pay attention to the reading of the channel A. 
With the rotary switch SWA at the topmost 16-bit con- 
tact position, the 16-bit sample AO is read. Then, sam- 
ples aO having a total of 8 bits are read at two 4-bit 

25 contact positions. At the next 1 6-bit contact position, the 
16-bit sample A1 is read. Then, samples a1 having a 
total of 8 bits are read at two 4-bit contact positions. As 
the rotary switch SWA rotates once, twin pairs of sam- 
ples AO, aO and A1 , a1 of the channel A are read out. In 

30 this manner, twin pairs of samples of the channel A are 
obtained in a time sequential form. Thereafter, with 
regard to the other channels B, C and so forth, samples 
are likewise read. Because twin pairs of samples are 
processed as each of the rotary switches SWO, SWA, 

35 .... and SWH makes one turn, the rotational period 
should be a half of the sampling frequency (fs/2). 

FIG. 14 illustrates another embodiment of the audio 
decoder. 

The illustrated embodiment processes data in the 

40 case where there are two channels and the number of 
quantization bits of each sample is 20 bits. This circuit 
differs from the one shown in FIG. 17 in the statuses of 
the switches JB^JH, j1 and j2. Therefore, same refer- 
ence numerals are given to those components which 

45 are the same as the corresponding components of the 
circuit in FIG. 13. 

Samples SO, S1 , eO, e1 and so forth are expressed 
as AO, BO, A1, B1, aO, bO, a1, b1 and so forth as a 
sequence of samples of the individual channels. Each 

so main sample of each channel consists of 16 bits, and 
each extra sample consists of 8 bits. 

As illustrated, only the switch JB is closed, and the 
switches JC to JH are open. With regard to those 
switches j1 and j2 which are associated with the extra 

55 samples aO, bO, a1 and b1. as illustrated, only the 
switches j1 are closed and the other switches are open. 
Those switches j1 and j2 which are associated with the 

other extra samples cO hO, c1 hi are all open. 

When the rotary switch SWO distributes input data 
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in synchronism with the data input, data to be trans- 
ferred are AO, BO, A1, B1 , aO (4 bits), bO (4 bits), a1 (4 
bits) and b1 (4 bits). The action of the rotary switch SWO 
allows the samples to be input to only the memories 
MAO, MBO, MA1 and MB1 in the illustrated order. 

On the output side, outputs are obtained from those 
of the memories MAO to MH1 which are associated with 
the channels A and B are acquired. Data 0 is output 
from the memories associated with the other channels. 
Of the switches j1 and j2 on the reading side, the 
switches j1 are closed and the switches j2 are open. 
Accordingly, a 4-bit extra sample is read out following a 
16-bit main sample. As regards the channel A, as the 
switch SWA is switched, data of the channel A is 
sequentially output in the order of AO, aO (4 bits), A1 and 
a1 (4 bits). 

The settings of the individual switches and the 
switching operations in the above-described embodi- 
ment are programmably set in accordance with the 
number of channels of audio streams and the number of 
quantization bits of each sample. Such a signal 
processing mode is described in the video title set 
attribute table shown in FIG. 10 and the packet header 
shown in FIG. 3. In other words, audio data included in 
an audio packet being linear PCM data, the audio frame 
number, the number of quantization bits, the sampling 
frequency, the audio channel number, etc. are 
described. 

The decoders illustrated in FIGS. 13 and 14 can 
cope with all the modes and are so-called full decoders 
that are adaptable in a high-class machine which can 
reproduce all the channels. 

The concept of this invention relates to a data 
arranging method, a recording/reproducing method and 
a processing apparatus, which can cope with various 
kinds of modes established by multifarious combina- 
tions of the number of channels and the number of 
quantization bits. The data arrangement can be 
adapted to the aforementioned high-class machine as 
well as a low-class machine which meets the demand 
for a lower cost, e.g., the one which reproduces only 16- 
bit data of two channels in every mode. Such a machine 
advantageously has a smaller circuit scale than the 
high-class machine. 

Although the switches which are used to distribute 
individual samples and acquire samples from the asso- 
ciated memories are illustrated as mechanical switches, 
they are all constituted of electronic circuits. 

An audio decoder in a low-class player will now be 
described. This audio decoder processes 16-bit data of 
only the channels A and B. Input samples are of eight 
channels and the number of quantization bits is 24 bits. 

A sequence of samples as discussed with refer- 
ence to FIGS. 1 A to 1 D is continuously input to an input 
terminal 810 in FIG. 15. This sequence of samples is 
given to the input terminal, 811, of a switch SWO. The 
switch SWO has distribution terminals for the individual 
samples of channels An to Hn and an to hn. The termi- 
nals which are associated with samples of the individual 
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channels are given the same reference numerals as 
representative samples, which are samples AO to HO, 
A1 to H1 , aO to hO and a1 to hi. 

It is assumed that the terminals AO to HO and A1 to 

5 H1 are 16-bit terminals, and the terminals aO to hO and 
a1 to hi are 4-bit terminals. Since the extra sample may 
consist a total of eight bits, two sets of 4-bit terminals, 
aO to hO and a1 to hi , are prepared. 

In this decoder, however, only the terminals AO and 

10 A1 , and BO and B1 are respectively connected to the 
memories MA and MB, with the other terminals C0-H0 
and cO-hO grounded. The switch SWO may be designed 
in this manner, or may be designed to have only those 
systems associated with the channels A and B from the 

75 beginning. 

The switches SWA and SWB are for reading data 
from the memories MA and MB in the units of 16 bits. 
Those switches SWA and SWB operate in such a way 
that output data are matched with one another. 

20 The operation of this audio decoder will now be dis- 
cussed. 

Samples SO, S1, eO, e1, .... which are arranged for 
the recording/transfer purpose and are to be input to the 
switch SWO can be expressed as AO, BO, HO, A1 , B1 , 

25 ... H1, aO, bO hO, a1. b1, .... hO as samples of the 

individual channels. Each main sample of each channel 
consists of 16 bits, and each extra word consists of 8 
bits. The switches of the circuit are all closed. As the 
rotary switch SWO is sequentially switched from the top- 

30 most contact, associated samples are transferred to the 
memories MAO and MB1 . The other samples are all dis- 
carded. 

Thereafter, the samples stored in the memories 
MAO and MB1 are read those of the channels A and B. 

35 Because two samples are processed as the rotary 
switch SWO turns once, the rotational period should be 
a half of the sampling frequency fs. Because one sam- 
ple is read as each of the rotary switches SWA and 
SWB turns once, the frequency is fs. 

40 Another audio decoder in a low-class player will 
now be discussed. This audio decoder processes 16-bit 
data of only the channels A and B. Input samples are of 
two channels and the number of quantization bits is 20 
bits. 

45 A sequence of samples as discussed with refer- 
ence to FIG. 1 is continuously input to the input terminal 
810 in FIG. 16. This sequence of samples is given to the 
input terminal 81 1 of the switch SWO. The switch SWO 
has distribution terminals for the individual samples of 

so channels An to Hn and an to hn. The terminals which 
are associated with samples of the individual channels 
are given the same reference numerals as representa- 
tive samples, which are samples AO, BO, A1 , B1 , aO, bO, 
a1 and b1. 

55 The terminals AO, BO, A1 and B1 are 16-bit termi- 
nals, and the terminals aO, bO, al and b1 are 4-bit termi- 
nals. To cope with the modes for two channels and the 
quantization bits of 20 bits, only the switch JB may is 
closed and the switches JC-JH are open. Those 
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switches j1 and j2 which are associated with the termi- 
nals aO, bO, a1 and b1 are closed and switches j3-j16 
associated with the other terminals are open. 

As the rotary switch SWO is sequentially switched in 
the above situation, no data transfer is performed. And 
only the main samples AO, BO, A1 and B1 are trans- 
ferred to the memories MA and MB. Regarding the extra 
samples aO, bO, a1 and b1, since their associated 
switches are grounded, those extra samples are dis- 
carded. The operation of reading samples from the 
memories MA and MB is carried out in the same man- 
ner as done in the previously described embodiment. 

Although the foregoing description of the low-class 
machine has been given with reference to two modes, 
data of two channels can be acquired in every mode in 
accordance with the selective open or closed states of 
the switches. The particular point that should be noted 
is that processing for extra samples is executed 8 bits by 
8 bits. The above-described data arrangement makes 
the number of bits of one pair of extra samples an inte- 
ger multiple of 8 bits regardless of the number of chan- 
nels, even if each extra word of each channel consists of 
4 bits. Even when extra samples are to be discarded in 
a low-class decoder, therefore, 8-bit processing is pos- 
sible. 

As the main words of extra samples each consist of 
16 bits, they can all be processed 8 bits by 8 bits, which 
is very advantageous in designing a specific circuit. 

Each audio pack has a pack header. As shown in 
FIG. 1 7, the pack header consists of a pack start code 
(4 bytes), a system clock reference (SCR) (6 bytes), a a 
program multiplexing rate (3 bytes) and a pack stuffing 
length (1 byte). The SCR represents the time required 
to fetch this audio pack. If the value of the SCR repre- 
sents is shorter than a reference value in the disk play- 
ing apparatus, the audio pack will be stored into the 
audio buffer. The control circuit refers to the pack stuff- 
ing length and determines an read address on the basis 
of the pack stuffing length. 

FIG. 18 shows the contents of the packet header of 
an audio packet. The packet header includes a 
packet_start_code prefix indicative of the start of a 
packet, a stream ID indicating what data the packet has. 
and data indicative of the length of the packet stream. 
Also described in the packet header are various kinds of 
information of packet elementary stream (PES), such as 
a flag indicating the inhibition or permission of copy, a 
flag indicating if the information is original one or copied 
one and the length of the packet header. A presentation 
time stamp (PTS) for synchronization of the output tim- 
ing of this packet with that of other video data or sub pic- 
ture is further described in the packet header. Further, 
information, such as a flag indicating rf there is any 
description on a buffer and the buffer size, is described 
in the first packet in the first field in each video object. 

The packet header also has stuffing bytes of 0 to 7 
bytes. The packet header further has a sub stream ID 
indicating an audio stream, linear PCM or other com- 
pressing type and the number of audio stream. Further 



described in the packet header are the number of 
frames of audio data whose first byte is located in this 
packet. Furthermore, a pointer for a unit to be accessed 
first is described by the number of logic blocks from the 

s last byte of this information. Thus, the pointer indicates 
the first audio frame to be decorded first at the time 
described by the PTS. The pointer indicates the first 
byte address of that audio frame. Further described in 
the packet header are an audio emphasis flag indicating 

w whether or not to be emphasized high frequency band, 
a mute flag for providing mute when audio frame data 
are all 0, and a frame number indicative of the frame in 
an audio frame group (GOF) which should be accessed 
first. Control information, such as the length of a quan- 

15 tized word or the number of quantization bits, the sam- 
pling frequency, the number of channels and the 
dynamic range, is also described. 

The above header information is analyzed by a 
decoder control section (not shown) in the audio 

20 decoder. The decoder control section switches the sig- 
nal processing circuit in the decoder to the signal 
processing mode which is associated with currently 
acquired audio data. The switched modes are as dis- 
cussed with reference to FIGS. 13 to 16. Information like 

25 this header information is also described in the video 
manager, so that when such information is read at the 
initial stage of the reproducing operation, the informa- 
tion need not be read again thereafter for the reproduc- 
tion of the same sub stream. The reason why mode 

30 information necessary to reproduce audio data is 
described in the header of each packet as mentioned 
above is because a receiving terminal can identify the 
mode of the audio data whenever reception starts in the 
case a sequence of packets is transferred by a commu- 

35 nication path. 

FIG. 19 is a block diagram of the audio data 
processing system incorporated in the disk playing 
apparatus, illustrating the system processing section 
504 and the audio decoder 513 in more detail than FIG. 

40 10. 

In the system processing section 504, an input 
high-frequency signal (read signal) is supplied to a sync 
detector 601. The detector 601 detects and extracts a 
sync signal from the read signal and generates a timing 

45 signal. The read signal now containing no sync signal is 
input to a 8-16 demodulator 602, which demodulates 
the 16-bit signal into a train of 8-bit data. The 8-bit data 
is input to a error correcting circuit 603. The data output 
from the circuit 603. which is free of errors, is input to a 

so demultiplexer 604. The demultiplexer 604 processes the 
data, recognizing the video pack, the sub-picture pack, 
and the audio pack according to the reference of the 
stream ID. These packs are supplied from the demulti- 
plexer 604 to the video decoder 508, the sub-picture 

55 decoder 509 and the audio decoder 51 3. 

Meanwhile, the audio pack is fetched into an audio 
buffer 611 , and the pack header and packet header of 
the audio pack are fetched into a control circuit 612. The 
control circuit 612 recognizes the contents of the audio 
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pack, i.e., the start code, stuffing length, packet start 
code and stream ID of the audio pack Further, the con- 
trol circuit 612 recognizes the sub-stream ID, the first 
access point, number of quantized audio bits, number of 
channels and sampling frequency. The stuffing byte 
length and the padding packet length are determined 
from these data items thus recognized, on the basis of 
the table shown in FIG. 4. 

The control circuit 612 recognizes the packet of lin- 
ear PCM based on the sub-stream ID. 

As a result, the control circuit 612 can identify the 
extraction address of the audio data stored in the audio 
buffer 61 1 . When controlled by the circuit 61 2, the audio 
buffer 61 1 outputs samples such as samples SO, S1 , 

eO, e1 , S2, S3 The control circuit 612 can recognize 

the number of stuffing bytes and/or the number of pad- 
ding packets once after it has recognized at least the 
number of quantized bits, the sampling frequency and 
the number of audio channels. The circuit 612 can 
extract data on the basis of these data items thus recog- 
nized. 

The samples output from the audio buffer 61 1 are 
supplied to a channel processor 613. The processor 
61 3 has a structure of the type shown in FIGS. 1 3 to 1 6. 
Its operating mode is controlled by the control circuit 
612. 

The audio packet, the video packet, the sub-picture 
packet and the recording tracks of the optical disk, all 
described above, have a specific physical relationship, 
which will be explained below. 

When a part of the recording surface of an optical 
disk 10 shown in FIG. 20A is magnified, trains of pits are 
seen as illustrated in FIG. 20B. A set of pit trains consti- 
tute a sector as seen from FIGS. 20C and 20D which 
are two other magnified views of the optical disk 10. The 
sectors are sequential read by the optical head. Then 
the audio packets are reproduced in Real time. 

The sectors will be described with reference to 
FIGS. 21 A and 21 B. As shown in FIG. 21 B, a sector in 
which audio data is recorded, consists of 13 x 2 frames. 
One sync code is assigned to each sector. Although the 
frames are shown in FIG. 21 B as if sequentially 
arranged in rows and columns, they are sequentially 
arranged in a single row on one track. More specifically, 
the frames having sync codes SYO, SY5, SY1 , SYS, 
SY2, SYS, ... are arranged in the order they are men- 
tioned. 

The sync code assigned to one frame consists of 
32 bits (1 6 bits x 2), and the data recorded in one frame 
consists of 1456 bits (16 bits x 91). This means that the 
sector is expressed by 16-bit modulated code, since 16- 
bit data items obtained by modulating 8-bit data items 
are recorded on the optical disk. Also recorded each 
sector includes a modulated error-correction code. 

FIG. 22A shows a sector in which there are 8-bit 
data items obtained by demodulating the 16-bit data 
items recorded in the physical sector described above. 
The amount of data in this sector is: (1 72 + 19) bytes x 
(12+1) lines. Each line contains a 10-byte error-correc- 



tion code. One correction code is provided for each line. 
When twelve correction codes for twelve lines, respec- 
tively, are collected, they functions an error-correction 
code for twelve columns. 

s The data recorded in one recording/recorded sector 

becomes a data block of the type shown in FIG. 22B 
when the error-correction signal is removed from it. The 
data block consists of 2048-byte main data, 6-byte sec- 
tor ID, a 2-byte ID error-detection code, 6-byte copyright 

10 management data, and a 4-byte error-detection code. 
As FIG. 22B shows, the sector ID, ID error-detection 
code and the copyright management data are added to 
the head of the main data, whereas the error-detection 
code is added to the end of the main data. The 2048- 

15 byte main data is one pack defined above. A pack 
header, packet header and audio data are described in 
the pack, in the order mentioned from the head of the 
pack. In the pack header and the packet header there 
are described various items of guide information which 

20 will be used to process the audio data. 

As indicated above, one packet which consists of 
audio samples arranged in a specific way is recorded in 
each recording/recorded sector on the disk The audio 
decoder can reproduce linear PCM data in a desired 

25 manner despite that the PCM data is recorded in one 
recording/recorded sector. This is because the start 
part of the audio data contained in any pack is the start 
part of the main sample, and also because the pack 
header and the packet header contain control data suf- 

30 f icient for the audio decoder to process audio data. 

An error-correction code (ECC) block will be 
described, with reference to FIGS. 23A and 23B. 

As shown in FIG. 23A, the ECC block consists of 1 6 
recording/recorded sectors. As shown in FIG. 22A, 

35 each sector can record 12 lines of data, each line being 
a 127-byte data item. A 16-byte outer parity (PO) is 
added to each column, and a 10-byte inner parity (PI) is 
added to each line. As shown in FIG. 23B, the 16-byte 
outer parity (PO) is distributed, one bit to each line. As a 

40 result, one recording/recorded sector holds 13 lines (1 2 
+ 1) of data. In FIG. 27A, "BO, 0, B0, 1, ..." indicate 
addresses of bytes. In FIG. 27B, "0, 1, 2, ... 15" desig- 
nate the 16 recording/recorded sectors, respectively. 
The video packs, sub picture packs and audio 

45 packs are interlaced on the track of the disk. However, 
this invention is not limited to this arrangement of the 
packs. This invention can be applied to the disk which 
only the audio packs are arranged, or the disk which the 
audio packs and sub packs are arranged, or the disk 

so which the audio packs, sub packs and NV packs are 
arranged. It is free to combine the packs each other. 

Claims 

55 1. A packing method for variable length data, charac- 
terized in that in storing plural pieces of sample 
data of different data lengths in a packet of prede- 
termined bytes in accordance with the number of 
channels, other pieces of sample data in said plural 
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pieces of sample data than first sample data are 
sequentially arranged with a top of said first sample 
data being positioned at a predetermined position 
of said packet, a total byte length of said sample 
data is equal to or smaller than a maximum byte 
length of said packet, and when a byte length of 
said packet is less than said maximum byte length, 
invalid data of a stuffing byte or a padding byte is 
inserted in a remaining portion. 

2. The packing method according to claim 1, charac- 
terized in that said plural pieces of sample data are 
linear PCM data and said maximum byte length is 
2013 bytes. 

3. A recording medium recording a sequence of pack- 
ets in such a way that in storing plural pieces of 
sample data of different data lengths in a packet of 
predetermined bytes in accordance with the 
number of channels, other pieces of sample data in 
said plural pieces of sample data than first sample 
data are sequentially arranged with a top of said 
first sample data being positioned at a predeter- 
mined position of said packet, a total byte length of 
said sample data is equal to or smaller than a max- 
imum byte length of said packet, and when a byte 
length of said packet is less than said maximum 
byte length, invalid data of a stuffing byte or a pad- 
ding byte is inserted in a remaining portion. 

4. The recording medium according to claim 3, char- 
acterized in that said plural pieces of sample data 
are linear PCM data and said maximum byte length 
is 20 13 bytes. 

5. The recording medium according to claim 3, char- 
acterized in that when said total byte length is less 
than said maximum byte length and said remaining 
portion has a length of 7 bytes or less, said stuffing 
byte is inserted in a packet header, and when said 
total byte length is equal to or greater than said 
maximum byte length, said padding byte is inserted 
at an end portion of said packet. 

6. The recording medium according to claim 3, char- 
acterized in that an even number of pieces of sam- 
ple data are arranged in one packet. 

7. The recording medium according to claim 4, char- 
acterized in that said stuffing byte is given to a 
packet including a head of an audio frame, and said 
padding byte is given to a packet which does not 
include a head of an audio frame. 



pieces of sample data in said plural pieces of sam- 
ple data than first sample data are sequentially 
arranged with a top of said first sample data being 
positioned at a predetermined position of said 

5 packet, a total byte length of said sample data is 
equal to or smaller than a maximum byte length of 
said packet, and when a byte length of said packet 
is less than said maximum byte length, invalid data 
of a stuffing byte or a padding byte is inserted in a 

10 remaining portion. 

9. A packet processing apparatus comprising: 

an input terminal for receiving a sequence of 
is packets structured in such a way that in storing 

plural pieces of sample data of different data 
lengths in a packet of predetermined bytes in 
accordance with the number of channels, other 
pieces of sample data in said plural pieces of 
20 sample data than first sample data are sequen- 

tially arranged with a top of said first sample 
data being positioned at a predetermined posi- 
tion of said packet, a total byte length of said 
sample data is equal to or smaller than a maxi- 
25 mum byte length of said packet, and when a 

byte length of said packet is less than said 
maximum byte length, invalid data of a stuffing 
byte or a padding byte is inserted in a remain- 
ing portion; 

30 an input switch for separating said sample data 

from said input terminal channel by channel 
and outputting sample words of each channel; 
a plurality of memories for storing sample 
words of said individual channels; and 

35 a plurality of output switches for reading sam- 

ple words of said individual channels from said 
memories of said individual channels. 

10. Trie packet processing apparatus according to 
40 claim 9, characterized in that said input switch has 

distribution terminals associated with all channels 
included in samples in said packet. 

11. The packet processing apparatus according to 
45 claim 9, characterized in that said input switch has 

distribution terminals associated with all channels 
included in samples in said packet, and some 
groups of said distribution terminals are connected 
to said memories while other groups of said distri- 
so bution terminals are grounded or open. 



A transmitting and receiving apparatus for receiving ss 
a sequence of packets structured in such a way that 
in storing plural pieces of sample data of different 
data lengths in a packet of predetermined bytes in 
accordance with the number of channels, other 
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